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Abstract — This paper proposes a 2-tier data hiding technique 
that involves Exploiting Modification Direction (EMD) method 
and Huffman Coding. Firstly, a secret message of an arbitrary 
plain text is encrypted, compressed and transformed into a 
stream of bits. Subsequently, the bits are converted into secret 
digits by using the Huffman dictionary table. Secondly, a 
cover image is segmented into groups of n pixels and each 
group is embedded with one secret digit by modifying one 
gray-scale value at most to hide the secret digit in (2n+l)-ary 
notational system. The experimental results have shown that 
both PSNR and payload of the proposed method are higher 
than that of well-known methods namely, OPAP, EMD and 
Opt EMD. 

Index Terms — data hiding, steganography, Exploiting 
Modification Direction (EMD) method, Huffman coding, data 
protection, cryptography. 

I. Introduction 

The need for methods that provides efficient work on the 
protection of data and private property of individuals has 
become very vital due to the huge growth of multimedia 
applications on networks. It is therefore important to create 
methods that provide security for the media from thieves and 
hackers to prevent them from tampering and misrepresenting 
the data. 

Data protection consists of two techniques: 
cryptography and data hiding. Cryptography means the 
provision of protection for data storage and data transfer 
while using a secret key. Encryption is still a successful way 
to protect stored and transmitted data over a network. But, 
with the growing use of networks to send and receive data 
on the global information network, it has become very difficult 
to maintain this data. Data hiding has two main approaches: 
steganography and digital watermarking. 

These two approaches have many techniques [2], one of 
them is LSB steganography which is defined as the practice 
of undetectably altering a Work to embed a secret message. 
Also, watermarking is defined as the practice of imperceptibly 
altering a Work to embed a message about that Work [1]. 
Steganography is also defined as the art and science of 
communicating in a way which hides the existence of the 
communication. In contrast to Cryptography, where the 
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enemy is allowed to detect, intercept and modify messages 
without being able to violate certain security premises 
guaranteed by a cryptosystem, the aim of steganography is 
to hide secret information inside other harmless messages to 
ensure nobody except the receiver can detect that there is a 
second message present. 

The steganography system consists of two procedures 
which are: the embedder and a detector. The embedder takes 
two inputs. One is the payload to embed the secret message, 
and the other is the cover Work in which to embed the 
payload. The output of the embedder is typically transmitted 
or recorded. Later, that Work is presented as an input to the 
detector. Most detectors try to determine whether a payload 
is present, and if so, output the message encoded by it [1]. 
Thus stenography is used to protect secret information by 
embedding secret messages in the host media (cover image), 
for instance: text, images, audio or video. 

II. Background of The Problem 

The basics of steganography are inaudibility, robustness 
and data rate (payload). The required trade-offs between the 
image quality and the payload, the steganography algorithm, 
must be at an acceptable level. In steganography, it is very 
hard to embed a large amount of data and preserve the high 
image quality at the same time. Therefore, if it is required to 
have more payloads, the image quality will be low. Conversely, 
a steganography algorithm is usually not efficient with high 
payload embedding [3]. 

Steganography techniques are mostly divided in two 
groups: spatial domain [8, 9, 1 1] and frequency domain [10, 
12, 13, 14]. The former embeds messages directly into the 
image pixels. The Least Significant Bit (LSB) insertion method 
is probably the most well-known image steganography 
technique. This method is easy to implement and has high 
quality but, it is extremely vulnerable to attacks such as image 
manipulation (compression and low pass filtering). 

Contrary to the above method, frequency domain 
technique embeds messages in the frequency coefficients of 
images by using a frequency-oriented mechanism such as 
Discrete Cosine Transform (DCT) [17], Discrete Fourier 
Transform (DFT), Discrete Wavelet Transform (DWT) [17], 
and so on. Generally, embedding in the low frequency can 
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withstand many attacks but provide clear effect on the media. 
In contrast, embedding in the high frequency provide less 
impact on the quality of images but creates low robustness 
to different attacks [16]. 

Now the question is: How could the payload is increased 
while maintaining high image quality? 

III. RELATED WORK 

The LSB is considered one of the most common methods 
[6, 7, 8]. This approach deals directly with a cover-image, 
after hiding a secret image within it. In general, bit-mapped 
images are commonly used. Every image consists of a set of 
pixels, and every pixel represents one colour. The values of a 
gray- scale image range from to 255. If the value of the pixel 
is equal to 0, it signifies darkness, and when is equal to 255, 
it indicates lightness. Therefore, a gray level image can be 
adjusted by adjusting these values. At least 8 bits are required 
to represent these values, and the binary system stores them 
from bits a 1 a 2 . . . a g . The LSB substitution changes the last bit 
(a^ to make imperceptible modification that cannot be 
detected by the human vision system. For instance, if the 
value of a pixel is 100, and we want to embed a 1, the pixel 
value becomes 101. Human vision cannot recognize this 
difference. However, the LSB can easily embed secret data 
into an image with an impalpable effect on the image. The 
pixel values after embedding can be computed using (1). 



x -x.-x. mod 2 k +m. 



(1) 



Where x., and x .refer to the pixel before and after embedding 
respectively, k refers to the number of bits which going to 
embed and m. refers to the value of secret message. Other 
researchers used Intermediate Significant Bit (ISB) planes 
[18, 19, 20, 21, 22, 23] to overcome the LSB drawbacks. Here, 
higher bit-plane is selected to embed the secret messages. 
Thus, it reduces stego-image quality but increases 
robustness. The higher the bit-plane is chosen the better 
robustness would be but the poorer image quality would 
become and vice versa [24]. 

In the Optimal Pixel Adjustment Process (OPAP) proposed 
by [6], this algorithm is effectively reduced distortion. The 
OPAP makes use of the last n bits to embed data and at the 
same time toggles the n+ 1 bit while comparing the toggle 
with least distortion. For example, bit stream "0011" if the 
embedding is"00"; the result is either "0000" or "0100 ". It is 
obvious that changing "00 11" to "0100" will cause less dis- 
tortion in comparison to "0000". Although this technique 
may reduce distortion, it still weak when value of n is more 
than two bits. Thus, the technique cannot handle higher im- 
age quality. 

Ref. [4], proposed the Exploiting Modification Direction 
(EMD) to reduce distortion. EMD made use of n pixels as a 
group to embed secret digits in a (2n+ l)-ary notational 
system. During embedding stage, it requires to increase or 
decrease one from the value of a particular pixel within the 
group using (2) and (3). 
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f(£hg2 gn) = \Zf=i(£i * i)]mod (_2n + 1) (2) 

s = (d - f) mod(2rc + 1) (3 ) 

Where n indicates no. of pixels, gl, g2..., gn represent values 
of pixel within each group, s refers to index of the image, d 
indicates value of secret digit. 

If f= d no change is needed , if/# d and s lesser or equal to 
n, g(J is increase by I. Iff" 6 d and s is greater than n, g( 2n+l _ 
s ) is decrease by 1. 

For example, let n=4, d=3 and the original pixel-group be [99, 
1 10, 120, 130]. Then/= (99 * 1 + 1 10 * 2 + 120 * 3 + 130 * 4 ) 
mod 9 = 2, s = ( d -f) mod 9=1 which is less than n. Thus, g() 
= ^ + 1 = 100. 

Second example: let d=9 and the same pixel-group is 
employed. Then s = ( d -f) mod 9 = 7 which is bigger than n. 
thus, g( 2n+1 . s ) = g( 2 )- 1 = 109. 

The embedded data can be extracted using the following 
extraction function of stego-pixel-group (4). 

d = i(gl r g2 gn) = [Zf^gi x i)]mod (2ti+ 1) (4) 

Ref. [5] proposes the relationship between the value of n and 
amount of payload that minimizes cover image distortion. 
Here, the value of n is computed prior to embedding using 
(5). 



x[log 2 (2n + l)l >p 



(5) 



where / means the total number of pixels of the cover image, 
n indicates the amount of pixels for each group and p 
represents the payload to be embedded. 

IV. PROPOSED METHOD 

The main aim of this method is to hide a large amount of 
secret message in high secrecy and the same time preserves 
the cover image's quality. Here, a new 2-tier data hiding 
technique is proposed that involves the EMD method and 
the lossless Huffman coding. At first, the secret data is 
encrypted, compressed and transformed into a stream of bits. 
Then the bits are converted into secret digits by using the 
Huffman dictionary table. Subsequently, the cover image is 
segmented into groups of n pixels and each group is 
embedded with one secret digit by modifying one gray-scale 
value at most to hide the secret digit in (2n+l)-ary notational 
system. 

Stepl : Construct a table containing individual letters and 

their frequency numbers. 

Step2: Sort the letters in an ascending order according to 
their frequency numbers. 

Step 3: Add the first two frequency numbers and then re- 
arrange the table again. 

Step 4: Repeat step 3 until a single frequency number is 
achieved. 

Step 5: Construct Huffman tree by assigning each pair of 
branches with (0,1) for all branches of the tree. 

Step 6: Rewrite the letters according to the above 
Huffman tree. 
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Step 7: Construct the final table containing all the secret 

letters with their codes. 
Example, let the message be thisis 

Stepl and step 2 



Letter 


Frequency 


t 


1 


h 


1 


i 




3 




2 =i iteration 


Letter 


Frequency - 


is 


4 


u 


2 



Step 3. I s ' iteriticn 


Letter 


Frequency 


i 




S 




a 




3 rd iteration 


Letter 


Frequency 


i : 3 : th 


6 



5t*p 5 




•A 

O 



5t*p 6 



Figure 1. Huffman tree 
Hu frman tabl# 



5t*p 7 



L^ttsr 


code 


i 


0(00) 


S 


1(01) 


t 


2(10) 


h 


3(11) 



B. EMD Method 

This method hides each digit of the Huffman codes into 
the segmented cover image by increasing or decreasing by 
one gray-scale value at most for each group of n pixels [4]. 
Here, the value of n is crucial to determine the right size of the 
group. Therefore, (5) is used to calculate n which resulted in 
only minimum distortion for the cover image. Then (2) is 
used to transform each n-pixel group into a reference value/. 
Embedding algorithm 

Input: cover image (gl, g2, ,gn), digits from Huffman 

encoding d. 
Output: stego-image 
Stepl: Fori=l:n 

f=(sum(g. * i)) mod (2n+l) 

end 
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Step2 
Step3: 

Step4: 
Step5 



If f"'d then s-d-f mod(2?z+l) 
If s < n then increase g( s ) by 1 
if s > n then decrease g( 2n+1 . s ) by 1 
Repeat 1 to 4 until all the digits of the secret 



message are processed. 



S 5-ntial of bits 



Hu ft xd an cod-inE alEoiithm 




Figure 2. Flow chart for embedding process 
C. Extraction of The Secret Codes 

The stego-image is received and (4) is applied to extract 
the codes. Then these codes are converted into binaries using 
Huffman dictionary and finally the original message of plain 
text is deciphered using Huffman decoding. Fig. 3 depicts 
the extraction method. 
Extraction algorithm 
Input: Stego-image 
Output: Secret data 
Stepl: Fori=l:n 

J=(sum(g.' * i)) mod (2^+1) 
End 

Step2: Convert (d) to binary. 

Step3: Decode this binary number to original letter by 

using Huffman decoding. 
Step4: Repeat 1-3 until end of text. 




Hu f tinaii die ti onarv 

~~ I — 



decodiiiE 



jE--r?t data 



Figure 3. Flow chart for extraction process 

V. EXPERIMENTAL RESULTS 

An experiment is conducted using four standard gray- 
scale images of 5 12x5 12 pixels as cover images shown in Fig. 
4 namely, Baboon, Airplane, Lena and Tiffany Furthermore, 
an arbitrary plain text is used as the secret message. To 
evaluate the quality of stego-image, the following peak signal- 
to-noise ratio (PSNR) is employed: 



PSNR = 10 log 



2 = =^ 



10 MSE 



(6) 



Where MSE refers to the Mean Square Error between two 
pixels. 
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S -=i Ej= i( cover (i J) — stego(ij')') 2 ^ 

MSE = 

r X c 

Where r and c refer to the number of rows and columns 
respectively for the cover image and stego-image. 

As shown in Table 1 , for the purpose of a benchmark test 
between the proposed technique and various well-known 
methods namely, OPAP, EMD and Opt EMD, two different 
sizes of payloads are used. Thus, the experimental results are 
divided into two parts: The 1 st . part utilizes full payload, 
whereas the 2 nd . part uses half of it. The results have revealed 
that our proposed method is superior than the rest of the 
techniques in terms of both PSNR and payload viz. 60.31 dB 
and 36000 byte, respectively. The difference in terms of PSNR 
is quite significant between the proposed method and the 2 nd 
best method which is around 5 dB. As for the 2 nd . part of the 
results, although the payload is reduced to half, it is obvious 
that the performance of the proposed method is maintained 
in relations to both PSNR and payload. Therefore, the 
proposed method is capable of embedding large amount of 
secret messages without sacrificing image quality. 

TABLE I. 

THE RESULTS OF THE PROPOSED METHOD, OPAP, EMD AND OPT. EMD 



capadty 


3276S byte 


36000 byte 




OPAP 


EMD 
n=2 


Opt EMD 
n=4 


Our method 
n=4 


Cover 
Image 


PSNR 


PSNR 


PSNR 


PSNR 


Lena 


51 14 


53.36 


54.67 


60.23 


B aboon 


51 14 


53.S7 


54.67 


6029 


Airplme 


51 14 


53.36 


5466 


6041 


Tiffany 


51 14 


53.37 


5466 


60.27 


AVG 


51 14 


53.37 


5466 


6031 
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H53S4 byte 


13000 byte 




OPAP 


EMD 
n=2 


OptEUD 
n=9 


Our method 
n=9 


Cover 
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PSNR 


PSNR 


PSNR 
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5416 


56.33 


53.37 


63 01 


Baboon 


5415 


56.39 


58.36 


63 06 


Airplm? 


5416 


56.33 


58,36 


62.95 


Tiffany 


5415 


56.39 


53.33 


62.96 
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5416 


56.39 


53.37 


62.99 




{a)Tiffany 



(b) Baboon 




fc) Lena 



(d) Airplme 



Figure 4. Test images (a)~(d) 



CONCLOSIONS 



In this paper, a 2-tier data hiding technique using Huffman 
coding and EMD method is proposed. The secret messages 
are encrypted, compressed and embedded in cover images 
with high secrecy without compromising the quality of the 
stego-images. Experimental results have revealed that the 
proposed method has successfully outweighed the well- 
known methods in both PSNR and payload. The experiments 
would be more meaningful if attacks were performed to 
measure the robustness; however this is beyond our scope 
of study at this juncture. 
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